O Group theoretic structures in the estimation of an 

(N : 

55 unknown unitary transformation 

^ ■ Giulio Chiribella 

. Perimeter Institute for Theoretical Physics, 31 Caroline Street North, Waterloo, Ontario, N2L 

2Y5, Canada 

Abstract. This paper presents a series of general results about the optimal estimation of 
physical transformations in a given symmetry group. In particular, it is shown how the different 
Ch ' symmetries of the problem determine different properties of the optimal estimation strategy. 

The paper also contains a discussion about the role of entanglement between the representation 
and multiplicity spaces and about the optimality of square-root measurements. 
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^ ■ 1. Introduction 

The estimation of an unknown unitary transformation in a given symmetry group is a general 
CN ' problem related to many stimulating topics, such as high-precision measurements [1], coherent 

^ ■ states and uncertainty relations [2], quantum clocks [3], quantum gyroscopes [4] and quantum 

reference frames [5]. The aim of this paper is to provide a synthetic account of the general 
theory of optimal estimation for symmetry groups, presenting new proofs and highlighting the 
underlying group theoretical and algebraic structures into play. 
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^ ' 1.1. Prologue: dense coding 



■ Let us start with a simple example. Suppose that we have at disposal a quantum system 

with two-dimensional Hilbert space ^ = and a black box that performs an unknown 
transformation Uij,i,j G {0,1}, defined on the set StiTi) of all density matrices on H as 
Ui,{p) := UijpU^yp G St(?^), where {Uij}lj ^0 are the unitary matrices 

Uoo = (J fj, Uoi = (J J) , Uw = (J _°^) Un = (J 0^ 

The value of the indices (i, j) is unknown to us and our goal is to find it out using the black box 
only once. Here the natural figure of merit is the minimization of the probability of error. 

Clearly, if we apply the black box to a state p £ St(T-L) there will always be an error: the 
four states {pij := Uij{p)}jj^Q will never be perfectly distinguishable (in a two-dimensional 
Hilbert space there cannot be more than two perfectly distinguishable states!). However, if we 
introduce a second system /C ~ 7^ and apply the unknown transformation to the projector on 
the entangled vector |^>) = -ij(|0)|0) + |1)|1)) £%®1C, where {|0),|1)} is the standard basis 

for C^, we obtain four states (the projectors on the vectors ■= {Uij iS" that are 

perfectly distinguishable: {^ij\^ki) = '^'^[UjjUki]/2 = SikSji. This remarkable fact was observed 



for the first time by Bennett and Wiesner, who exploited it to construct a protocol known as 
dense-coding [6]. The moral of dense coding is that the use of entanglement with an ancillary 
system can improve dramatically the discrimination of unknown transformations. Despite the 
extreme simplicity of the mathematics, this curious example points out structures that are much 
deeper than it might seem to the first sight. The aim of the present paper is to illustrate these 
structures in the general context of the estimation of an unknown group transformation. Our 
analysis will include dense coding, where the group of interest is the Klein group Z2 x Z2. 

2. General problem: estimation of an unknown group transformation 

2.1. The problem 

Suppose that we have at disposal one use of a black box and that we want to identify the action 
of the black box. Suppose also that we have some prior knowledge about the black box: in 
particular, we know that 

(i) the black box acts on a quantum system with finite dimensional Hilbert space H c± C^, d < 
00. 

(ii) it performs a deterministic transformation Ug belonging to a given representation of a given 
symmetry group G. 

Mathematically, a deterministic transformation (also known as quantum channel) is described 
by a completely positive trace-preserving linear map C acting on the set St(H) of quantum states 
(non- negative matrices with unit trace) on the Hilbert space H. The representation of the group 
G is then given a function U : g ^ Ug from G to the set of quantum channels, with the usual 
requirements 



where e G G is the identity element of the group and X-^ is the identity channel, given by 
^h{p) •= P) £ St('H). In this paper the group G will be always assumed to be either finite or 
compact. 

2.2. Estimation strategies without ancillary systems 

Since the group G and the representation {l/(g}gQQ are both known, the problem here is to identify 
the group element g E G that specifies the action of the black box. How can we accomplish this 
task? A first idea is to prepare the system in a suitable input state p G St{T-L) and to apply the 
unknown transformation Ug to it, thus obtaining the output state pg := Ug{p). The procedure 
can be represented diagrammatically as 



In this way, the output state pg will carry some information about g and we can try to extract this 

information with a quantum measurement and to produce an estimate 5 G G. The combination 
of the quantum measurement with our classical data processing can be described by a single 
mathematical object, namely a positive operator-valued measure {POVM, for short). 

Let us denote by Lin(7^) the set of linear operators on T-L and by Lin-|_(7^) C Lin(7^) the set 
of non-negative operators. If G is a finite group, a POVM with outcomes in G is just a function 
P : G — )■ Lin_|_('H) sending the element ^ G G to the non-negative operator Pg G Lin_|_('H) 
and satisfying the requirement J2geG Pg = ^H, where I-^ is the identity on H. The conditional 
probability of inferring g when the true value is g is then given by the Born rule ^(^l^) = Tr [P^pg]. 




V<7G G 
yg,heG 



In general, if G is a compact group and (t(G) is the collection of its measurable subsets [7], 
a POVM with outcomes in G is an operator-valued measure P : ct(G) — t- Lin-|-(7^) sending the 
measurable subset B G cr{G) to the non-negative operator Pg G Lin_|_('H) and satisfying the 
requirements 

• Pg = Ih 

. if 5 = U,=i Bi and {Bi}^^ are disjoint, then Pb = EZi 

The conditional probability that the estimate g lies in the subset B e a{G) is given by the Born 
rule piB\g) = TT[PBPg]. 

In the following I will frequently use the notation P{dg) to indicate the POVM P. 
Accordingly, I will also write p{dg\g) = TT[P{dg)pg] and Pb = J^P{dg). In the case of finite 
groups, the notation P{dg) will be understood as synonym of Pg and the integral over a subset 
B will be understood as a finite sum. The pair {p,P) of an input state and a POVM will be 
referred to as an estimation strategy. 



2.3. Estimation strategies with ancillary systems 

In the previous paragraph we discussed strategies where the unknown transformation was applied 
to a suitable state p G St(^). However, these strategies are not most general ones: we can 
introduce an ancillary system with Hilbert space /C, prepare a bipartite input state p G St('H(8)/C), 
and then apply the black box on system T-L, thus obtaining the output state pg := {Ug ®Ik){o'). 
The schematic of this procedure is 
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In this case, to obtain an estimate g of the unknown group element g we will use a POVM P 
on the tensor product Hilbert space H'SilC. 

Classically, we would not expect any improvement from the use of an ancillary system because 
this system will not carry any information about the unknown transformation. However, here the 
state p can be an entangled state, that is, a state that cannot be written as a convex combination 
of tensor product states. As the example of dense coding teaches, the use of entanglement can 
improve the estimation dramatically. Note, however, that mathematically every estimation 
strategy with ancillary system can be reduced to the form of an estimation strategy without 



ancillary system by choosing H' := H i 



I /C and Ug 



Unfile- 



2.4- Cost of an estimation strategy 

To quantify how far is the guess g from the true value one typically introduces a cost function 
c : G X G — >■ M, (^, 5) !->■ c{g,g) which takes its minimum value when g = g. If the true value is 
g, the expected cost of the estimation strategy (p, P) is 

c{p,P\g)-= c{g,g)p{dg\g) = c{g, g)Tr[P{dg)pg]. 
JG JG 

Ideally, we would like to minimize the cost simultaneously for every g. However, in general this 
is not possible because the number of states in the orbit {pg}geG is larger than the dimension 
of the Hilbert space and therefore there is no way in which the states {pg}g(zc can be perfectly 
distinguishable. 

Since the true value 5 G G is unknown, one approach to optimization is to minimize the 
worst- case cost of the strategy, defined as 



Cwc{p,P) = ^aj<Lc{p,P\g). 



(1) 



An alternative (and less conservative) approach consists in assuming a prior distribution Tr{dg) 
for the true value g £ G and in minimizing the average cost. Given that g is completely unknown, 
a natural choice of prior is the Haar measure d^, normalized as d^i = 1. With this choice, the 
average cost is given by 

Caveip,P):= J^dgcip,P\g). (2) 

For finite groups we can understand the integral as a finite sum and d^ as synonymous of 1/|G|, 
where |G| is the cardinality of the group G. 

In general, the worst-case and the average approach are very different: for example, the 
functional CaveiPiP) is linear in both arguments while the functional Cu,c{p,P) is not. The two 
approaches may lead to very different optimal strategics: for example, since Caveip^ P^ is linear 
in p the optimal input state can be searched without loss of generality in the set of pure states 
p = \ifi){ip\, while this reduction is generally not possible for the minimization of Cy,c{p-,P)- 
However, when the cost function enjoys suitable group symmetries it is possible to show that 
there is a strategy (p, P) that is optimal for both the worst-case and the average approach. This 
point will be illustrated in Section 4. 

3. Basic notions and notations 

3.1. Bipartite states 

Let {\4>i)}'i=i (resp. i) be a fixed orthonormal bases for the Hilbert space T-L (resp. /C). 

The choice of orthonormal bases induces a bijective correspondence between linear operators 
from fC to % and bipartite vectors in C?) /C. Following the "double- ket notation" of Ref. [8], if 
A G Lin(/C,^) is a linear operator from /C to "H we define the bipartite vector |^)) G ^ ® /C as 

■.= Y.Y.^<P^\A\i^J) \<t>rm. (3) 
i=\ 3=1 

Two useful properties of this correspondence are given by the equations 

{{A\B)) = Tt[A^B] VA-B G Lin(/C,H) (4) 

\A)) = {A®I^)\Ik.)) 

= {Iu®A')\Ih)) yAeL\o{)C,H), (5) 

where denotes the transpose of A with respect to the fixed bases. Using the singular value 
decomposition of ^4, it is easy to see that the state |^)) is of the product form \A)) = \(p)\tp) if 
and only if A is rank-one. Moreover, if the dimension of /C is larger than d-^, then for every 
state 1^)) there exists a d^^-dimensional subspace }Ca such that |^)) £ ICa- This is clear 
from Eq. (5): the dimension of the image of AJ- cannot exceed the dimension of its domain %. 

3.2. Group representations 

The unknown transformation in our estimation problem belongs to a group representation 
{Ug}g^Q, where each map Ug is a completely positive trace preserving map sending operators 
in Lin(^) to operators in Lin('H). Now, the condition Ug-iUg = X-^ implies that the map Ug 

must have the form Ug{p) = UgpUg, where Ug G L\n{'H) is a unitary matrix. In particular, 
foi g = e we can choose Ue = In- Moreover, the condition UgU^ = hlgh,\/g,h G G implies 
that the unitaries {Ug}g^Q, define a projective unitary representation, that is, a function 
U : g ^ Ug sending elements of the group to unitary operators and satisfying the relation 



UgUfi = uj{g, h)Ugfi,yg, h E G, where uj{g, /i) G C is a multiplier, satisfying the properties 



\^{9,h)\ =1 \/g,he G 

u}{g, h)uj{gh, k) = Lo(g, hk)uj{h, k) \/g, h,k & G 

u{g,e) =uj{e,g) = 1 ^g & G. 

Since the group G is compact and the space H is finite dimensional, with a suitable choice of 
basis the Hilbert space can be decomposed as 

^=0 ^/.®c'"^ (6) 

fie\rr{U) 

where the sum runs over the set lrr(f/) of all irreducible representations contained in the isotypic 
decomposition of U, "H^^ is a representation space of dimension d^, carrying the irreducible 
representation W^, and C"*'' is a multiplicity space, where is the multiplicity of the irreducible 
representation in the decomposition of U. Accordingly, the representation U can be written 
in the block diagonal form 

[/= U^^^Im,. (7) 
Melrr(!7) 

Note that all irreducible representations U^^ must be projective unitary representations and must 
have the same multiplier u. 

In the optimization of the estimation strategy (p, P) we can take the multiplicities as large 
as we want. Indeed, introducing an ancillary system /C means replacing the Hilbert space 7i with 
the tensor product H' := ^ /C and replacing the unitary Ug with the unitary U'g := Ug (g) Ijc for 
every g £ G. In Eqs. (6) and (7) this means replacing the multiplicity space C™'' with C"^^ ®/C 
and the multiplicity with := m^dx,- In particular, we are free to choose the ancillary 
system /C so that the condition > d^ is met for every jj, G \rr{U). 

3.3. Bipartite states and group representations 

Here we combine the facts observed in the paragraphs 3.1 and 3.2. Let us consider a vector 
1^') G H in a Hilbert space where the representation U acts. Using the isotypic decomposition 
of Eq. (6) and the correspondence of Eq. (3) for each tensor product T-L^ (g) C"**" , we can write 

^l&\rr{U) ^l&\rr{U) 

where ^ ^ is a linear operator in Lin(C'"^,'Hy^). In other words, the vector \^) can be written as 
a linear superposition of bipartite vectors. As we will see in the next sections, the fact that we 
can have entanglement between each representation space 'H^ and the corresponding multiplicity 
space C'"'' is the key ingredient to the construction of the optimal estimation strategies. 

Suppose that U has been chosen so that > d^ for every fi G \rr{U). From the last 
observation of paragraph 3.1 we know that each vector 1^'^^)) will be contained in a subspace 
nf,0lC^, where /C^ ~ C^'' ~ is a suitable subspace of C"''. Therefore, every vector 
l^*) belongs to a suitable subspace Ti ~ ©/ieirr((7) ^ ^m- Note that the whole orbit 
ll^p) := Ug\^)}g^Q belongs to the subspace H. In conclusion, as long as we are interested 
in pure input states we can effectively replace T-L with the space 7i := ©^girr(;7) ® where 
"^/i = c^/i for every G \n{U) and we can replace the representation U with the representation 
U defined by Ug := 0^gi|.r((7) Ug (8) Ifj,, where denotes the identity on H/j,. 



3.4- The class states _ 

An important family of states inH = ®n^\rr(u) family of states of the form 

m= ^la, c,ec, ^ \c,\' = i (8) 

/ielrr(l/) V '^A' /ielrr(f/) 

These states are linear superpositions of maximally entangled states in the sectors Ti/^ Ti/j,. 
In the following I will call these states class states. The reason for the name comes from the 
fact that these states correspond to class functions (see definition below) through the Fourier- 
Plancherel theory (see e.g. [9]). The remaining part of this paragraph is aimed at making this 
correspondence clear. 

Let us start from Fourier-Plancherel theory. Denote by lrr(G,a;) the set of all irreducible 
representations of G with multiplier and denote by M^- ((jf) := {(i,i\Ug\ix,j) the matrix element of 

the operator Ug with respect to a fixed orthonormal basis for the representation space 

"H^. Recall that the functions {i2^- {g) := -yd^iii^*((7)}^eirr(G,w);i j=i,...,dp are an orthonormal basis 
for the Hilbert space L2(G,dg') endowed with the scalar product (/i,/2) := /g /i (5)72(5) • 
Using this fact, we can expand every function / G L2{Q,dg) as 

i/)= © E/^K*)- (9) 

/ielrr(G,w) «,j=l 

On the Hilbert space L2(G, d(7) we consider the action of G given by the left-regular representation 
T with multiplier a;, defined by {Tgf){h) := iu{g, g^^h)f(g^^h), V/ G L2{Q,dg),\lg,h G G. [10]. 
In particular, the transformation of the functions u^* (g) is given by 

TM;) = E<^(9)\<P^ (10) 

k=l 

that is, for every fixed value of j the vectors {\uf*)}f!li span an invariant subspace carrying the 
irreducible representation C/^. It is then easy to see that the unitary operator V : L2(G,dg) — )■ 
®^.e\n{G,u,) "^M <^ ^/^ defined by 

y|<;) := V/xGlrr(G,Cc;),Vi,j = l,...,d^ (11) 

intertwines the two representations Tg and Wg = ©^girr(G uj) ^g^^iJ.'- indeed, using Eqs. (10) and 
(11) one has VTg\u'^*) = U^\fi,i)\n,j) = {Ul^0l^)V\u'^*) for every G lrr(G,c<;), Vi,i = l,...,d^, 
and therefore VTg = WgV. Applying the unitary V to the expansion of Eq. (9) we then obtain 

dp 

Let us come now to class functions. A class function is a function that is constant on conjugacy 
classes, that is, f{hgh~^) = f{g) for every g,h e G. Any class function can be written as a linear 
combination of irreducible characters, namely f{g) = E^eirrCG) fiJ.xl.{g), where xliig) = T^Pg*]- 
Exploiting the correspondence given by the unitary V we obtain 

v\f)= ^la- 

/i6lrr(G,a;) V "/^ 



It is now clear that the class states of Eq. (8) are nothing but the projection of the class functions 

to the finite dimensional subspace % C 0^girr(G uj) ® '^ij-- 

Besides providing a justification for the choice of the name class states, the Fourier-Plan cherel 
theory also provides a hint of the fact that the class states are optimal for estimation. 
Indeed, if we take the scalar product of two states in the orbit {\^g) ■= Ug\^)} we obtain 

I 1 2 

= J2fxe\rr{u) '^T~XiJ.{9~^h). If Cfj, is chosen so that C/j, = Xd^ for some constant A G C 
we then obtain oc X]^£irr([/) di_iXn{9~^h), which is nothing but the truncated version of 

the Dirac delta S{g,h) = J2^e\rr{G u) df^Xfi{9~^h). This provides an heuristic argument for the 
optimality of the class states, although the choice of the coefficients {c^} to provide the "best" 
finite dimensional approximation of the Dirac delta will depend on the choice of the cost function 



3.5. Notations 

If y € Lin(?^) is a unitary matrix, the symbol V will always denote the completely positive 
map V{p) = VpV^. Let C : Lin(H) Lin(/C) be a completely positive map and let 
Cip) = ELiC'ipCj, d G Lin(H,/C) be a Kraus form for C. The notation C^,C*,C'^ will be 
used for the following maps 

r 
i=l 

c*{p) ■.= j2cipcl 

i=l 

c^ip) :=j2crpct = {nHp), 

i=l 



where the complex conjugation * and the transpose T are defined with respect to the fixed bases 
for y, and JC. Note that the map is the adjoint of C with respect to the Hilbert-Schmidt scalar 
product {{A\B)) = Tt:[A^ B], A, B, e Lin(-H): indeed, we have {{A\C{B)) = {{C^A)\B)).) Note also 
that the definition of the maps C\C*,C'^ does not depend on the choice of a particular Kraus 
form. 

Consider the Hilbert space H = ®n^\„{u) ^/i®^/*' representation U = ®n^\„{u) ^^^^iii 
and let 



U'=\ I^®B^\B^eUn{n^,) 

./"6lrr(C/) 



be the commutant and the bicommutant of U , respectively. Every class state in Eq. (8) then 
defines a modular conjugation: for every operator A = 0^girr({/) A^iSi & U" I will denote by 
A^ G U' its modular conjugate, given by 



(12) 



Every class state |$) G H then enjoys the symmetry 

UgU^\^) = \^) V^gG, (13) 

as it can be easily verified using Eq. (5) for every /j, G \rr{U). The modular conjugate 
representation will be denoted by U^. For a completely positive map C{p) = Yll=i^iP^i 
with Ci G U" for every i = l,...,r we will denote by the completely positive map 
C^(p) ■= C^pCf^- Note that in general we have 

C(|$)($|)=C«t(|$)($|). (14) 

Introducing the swap map S, defined by 5 (^0^£irr([/) ® -^m) ®ne\rr{u) ^n^^n^ we obtain 
the relations 

A = S {A^*) \/A e U" 

r 

C = SC^*S W :C{p) = ^CipCl CiEU" yi = l,...,r 

i=l 

4. Optimal estimation 

4.1. Left-invariant cost functions: optimality of covariant measurements 
In this paragraph I will shortly review two classic results about the structure of covariant 
measurements and about their optimality. The proofs of these results can be found in the 
monographs [11, 12, 13]. 

Let us start from the definition: 

Definition 1 A POVM P : cr(G) — > Lin+(^) is covariant with respect to the representation 

{UgjgeG ifPgB = Ug{PB) e G,V5 G (7(G). 

Covariant POVMs have a very simple structure: 

Theorem 1 (Structure of covariant POVMs) Let G he a compact group, U he a projective 
unitary representation of G on the Hilhert space H. A POVM P : (t(G) — >■ Lin_|_('H) is covariant 
with respect to U if and only if it has the form 

P{dg)=Ug{0d9 (17) 

where dg is the normalized Haar measure and ^ G Lin+(H) is a suitable operator, called the seed 
of the covariant PO VM. 

The special form of covariant POVMs provides a great simplification in optimization 
problems, because it reduces the optimization of the whole POVM to the optimization of a 
single non-negative operator ^ G Lin+(^). Luckily, this simplification can be done without loss 
of generality in most situations. Precisely, covariant measurement are optimal whenever the cost 
function c{g,g) is left-invariant, that is c{hg,hg) = c{g,gyig,h G G. 

Theorem 2 (Optimality of covariant POVMs) Let G he a compact group, {?7g}c,eG a 
unitary projective representation of G on the Hilhert space % and c{g, g) he a left-invariant cost 
function. Then, for every input state p G St(^) the optimal POVM P for both the worst-case 
and uniform average approach can he assumed without loss of generality to be covariant. With 
this choice one has Cave{p,P) = Cwdp^P) = c(/9, -P|fl')Vg' G G. 



(15) 
(16) 



4-2. Right-invariant cost functions: optimality of class states 

In this paragraph I illustrate with a new proof a recent result on the optimality of class states 
[14]. This result is dual to the classic result on the optimality of covariant POVMs showed in 
the previous paragraph. 

A central feature of quantum theory is the validity of the purification principle [15]: for 
every mixed state p G St{T-L) there exists a Hilbcrt space /C and a pure state I'l')) ^ % ® IC 
such that p = Tryc[|^))((^'|] The pure state ^ is referred to as a purification of p. From Eq. 
(5) we have p = and Supp(p) = Rng(^'), where Supp and Rng denote the support and the 
range, respectively. The simplest example of purification of a state p G St(H) is the square-root 
purification |p2 )) G ^ (g) 71', where Ti' ~ Ti. 

A large number of quantum features, such as teleportation and no cloning, are simple 
consequences of the purification principle [15]. The structure of the optimal states for the 
estimation of group parameters is no exception to that. In the following I will give a simple proof 
of the optimality of the states in Eq. (8) based on the purification principle. To this purpose I 
will use a remarkable consequence of purification, namely the fact, first noticed by Schrodinger 
[16], that every ensemble decomposition of a state can be induced from the purification via a 
quantum measurement on the purifying system: 

Lemma 1 Let {X,a{X)) be a measurable space and let p{dx) : cr{X) — Lin_|_(^),i3 i— > ps be 
an ensemble, that is, an operator-valued measure such that px G St(^). // |^')) £ Ti <Si JC is a 
purification of px then there exists a POVM P : a{X) -> Lin+(/C),B G a{X) Pb such that 

PB = t^k[{Ih ® PB)mm] ys g a{x). (is) 

Proof. Let be the inverse of * on its support and define the POVM P via Pb '■= 

[^^^ Pb{^^)~^] ■ The POVM P provides a resolution of the projector on the range of and 
can be easily extended to a resolution of the identity on /C. Using the identity ^'^'"^ = -Psupp(px) 
where nsupp(p^) is the projector on the support of px we have 

= Tr^4(|^'^'-VB))((1'^-'|] 

= TrH'[|nsupp(px)/'B))((nsupp(px)] 

= pB- 

■ 

Suppose that the cost function c{g,g) is right-invariant, that is c{gh,gh) = c{g,g),\/g,g,h G 
G. Then we have the following 

Theorem 3 (Optimality of the purification of invariant states) Let Q be a compact 
group, U be a projective unitary representation of G on the Hilbert space H and c{g,g) be a 
right-invariant cost function. Then, the optimal input state for both the worst-case and uniform 
average approach can be assumed without loss of generality to be the purification of an invariant 
state p E U' . Denoting by l'^) ^T-L® C such a purification, there exists an optimal POVM P on 
n® C such that Cave{^,P) = c«,c(*,-P) = c(*,P|c/),Vc/ G G. 

Proof. Suppose that a G St('H (g) /C) is an optimal input state and that Q{dg) is an optimal 
POVM for the average approach (resp. for the worst case approach). We now prove that there 
is a Hilbert space C and another state |'I') G Ti <S> C that is the purification of an invariant 
state p & U' and is optimal as well. Consider the orbit {a/j := {Ufi l!c)ic)}h&G and the 
operator-valued measure a{dh) = ahdh. Clearly the state aQ = Jq dh ah is invariant under 
U (g) I)c- Let I*) G H (g) /C (g) (g) /C', with H' 2± H,/C' 2± /C, be the square-root purification 



of gq and define C := IC ® %' ® K.' . Note that |*) G T-L ® C \s also a purification of the 
invariant state p := Trjc[(7Q] G U'. We now have to show that |^') is optimal for the uniform 
average (resp. worst-case) approach. By lemma 1, we know that there is a POVM Mfidh 
on n' O IC' such that ph = Trn'(S>lC'[{In <S) Ik ® M/j)|^)(^|] V/i G G. Define now the POVM 
P(d^) := Jq dh Q{d{gh)) (g) M^. It is easy to see that the average cost of the strategy P) is 
equal to the average cost of the strategy {a, Q) : 

W(^,^')= f dg f cig,g) {{^g\P{dg)\^g)) 
JG JG 

= ^d<7^d/i^ c{g,g) {{^\{UlQ{d{gh))Ug^Mhm) 

= j^^hj^dgj^ c{g,g) T:t[UlQ{d{gh))Ugah] 

= I dh I d{gh) I c{gh,gh) TT[Q{d{gh))cT9h] 
J G J G J G 

having used the right-invariance of the Haar measure and of the cost function. On the other 
hand, the worst-case cost of the strategy P) cannot be larger than the worst-case cost of the 
strategy (a, Q): 

c^c{^,P) = max / c{g,g) { ^g\P{dg)\^g) 

= ma^ f dh f cig,g) { ^\{UlQ{d{gh))Ug Mh)\^') 

ffeG Jg Jq 

= m^ / dh f c{g,g) Ti[UlQ{d{gh))Ugah] 

ffeG Jq Jq 

= max dh c{gh,gh) Tr[Q{d{gh))agh] 

SeG Jq Jq 

= max / dh c{a,Q\gh) 

< / d/imax c{a,Q\gh) 
Jh aeG 

Hence, we proved that, both for the average and for the worst case approach, whichever the 
optimal strategy (a, Q) is, the strategy ('I', P) cannot be worse. Finally, we observe that for the 
strategy (*,P) we have c{^,P\g) = /cd/i^ c{gh,gh) Tv[Q{d{gh))agh] = Jf^dh c{a,Q\h). This 
proves that c{^^P\g) is independent of G G, whence c(^,P|^) = Cave{^-,P) = Cwc{^,P)- ^ 

Theorem 4 (Optimality of the class states) Let G be a compact group, U be a unitary 

projective representation of G on the Hilhert space % satisfying the property < m^V/i G lrr([7) 
and c{g,g) be a right-invariant cost function. Then, the optimal input state for both the worst- 
case and uniform average approach can be assumed without loss of generality to be a class state 
of the form 

l*)= \/fl^'^))^^= © n^^n^.cn, (i9) 

where Pn>0 and ^ = 1. 



Proof. By theorem 3, we know that there exists an optimal state that is the purification of an 
invariant state p G U' . Now the form of an invariant state is p = ©^girr([/) -^m Pij-i with 
Pn G St(C"*''). The square-root purification of p is then given by 

Here the first copy of is the representation space, while the remaining spaces contribute 

to the multiplicity. The action of the representation {Ug ® ly^g^Q then generates the orbit 
I — 1 

1*9) ■= ©Meirr(!7) V 1^ Pg))\pfi))- This orbit can be obtained from the orbit \^g) := 
©M6irr(t/) ^jf, M)) ^ H by applying the isometry V : e^eirr(c/,) n^^H^^ e^6irr(c/) ® 
® C^^" ® C"»M defined by F(©^6i,,([;) 1^;^)) = ®^^wr{u) \^i^)\ph- This proves that the state 
I*) = ©Meirr(c/) Y^I-^m)) is an optimal input state. ■ 

4.3. Invariant cost functions and isotropic states: optimality of isotropic seeds 
Suppose that the cost function c{g,g) is invariant, that is, both left- and right-invariant. 
Equivalently, this means that c{g,g) = f{g~^g) where / is a class function. Prom the discussion 
of the previous paragraphs we already know that i) the optimal input state will be class state 
and ii) the optimal POVM will be covariant. In fact, in this case the optimal POVM has an 
additional symmetry that reflects the symmetries of class states (see Eq. (13)). Furthermore, 
this additional property appears not only for class states but also for a more general family of 
input states that we call isotropic. 

Definition 2 (Isotropic operators) An operator A G Lin('H) with H = 0^girr([/) ^ T^ti 
isotropic if 

UgU^iA) = A V5 G G. 

The set of isotropic states (i.e. non-negative isotropic operators with unit trace) is convex. 
Clearly, every class state is isotropic. As a consequence, every mixture of class states is an 
isotropic state. 

Theorem 5 (Optimality of isotropic seeds) Let G be a compact group, U be a projective 
unitary representation of G on the Hilbert space H = ©jugirr(i7) ^a*' ^(^,(7) be an invariant 
cost function and p G St{H) be an isotropic state. Then, the seed ^ G Lin+(^) of the optimal 
covariant measurement for the input state p is isotropic. 

Proof. Since the cost function is left-invariant, the optimal POVM can be assumed to 
be covariant (theorem 2) and there is no difference between the average and worst case 
approaches. Moreover, for any covariant POVM P{dg) = Ug{^)dg we can define another 
covariant POVM P'{dg) = Ug{^')dg with ^ where 4' is isotropic. Indeed, it is enough to choose 
^' '■— Ig ^h^h iO- Note that ^'(d^) is still a resolution of the identity. The uniform average 
cost of the strategy (p, P') is the same of the strategy (p, P): 

caveip, P') = j^^a dff c{g, g) Tr[Ug{^')Ug{p)] 

= f dg f dg f dhc{g,g)lh:[Ugh{mgU^\p)\ 

= I dh I dg j dg c{gh,gh)Ti[Ugh{m9h{p)] 
1/ G G " G 



having used the fact that p is isotropic (and hence Uy^ \p) = Uh{p)) and the right-invariance of 
the cost function. ■ 

An example of isotropic seed is given by the class seed ^ = where \r]) eT-L is the class 

vector 

l^)= Vd'.m- (20) 

We will now see that this seed plays a special role: any isotropic seed can be obtained from it 
by applying a suitable physical transformation. 

Theorem 6 G Lin+('H) is an isotropic seed if and only if ^ = C{\^){ri\) for some completely 
positive map C : Lin(^) — )• Lin(^) that is bistochastic (i.e. trace-preserving and unit-preserving) 
and leaves all elements in the algebra U' invariant. 

Proof. Let us write $, = Yli=i and \r]i) = 0^eirr([/) V^I^^m))- Defining the operators 

Ci := ©^(=||.|.([/) ^i^®^^ £ U" and defining the completely positive map C as C{p) := Yll=i ^iP^l 
we then obtain ^ = C{\ri){ri\). Clearly, one has C{A) = A for every A € U' . The requirement 
that C be bistochastic is equivalent to the requirement that be an isotropic seed. First, we can 
prove that C is unital: 

C{I) = J^dg CU,{\rjm = l^dg CU^Wnm 

= J^dg ZYf C(|r/)(r;|) = J^dg ZYf (0 = J^dg U,{i) 

To prove that C is trace-preserving, wc cquivalcntly prove that is unital 
immediate to obtain C^^(/) = /: indeed, we have 

C«t(7)= f dgC^%mrj\)= I dg U^C^"^ {\r{) {r^\) 

JQ Jg 

= [ dgUgC{\v){v\)= [ dgUg{0 = I 
JG JG 

Finally, it is enough to observe that = S [C^"^)* S (see Eq. (15)) and, therefore, C^(7) = /.■ 
Corollary 1 Let ^ G Lin(^) be an isotropic seed, written as ^ = = 
©/.eirr(c/) VdHhn))- Then we have ELi = ELi = lu- 

lu the following the covariant POVM -^(d^) = Ug{\rj){rj\)dg generated by the class vector \rf) 
in Eq. (20) will be called class POVM. 

Corollary 2 Any covariant POVM P{dg) = Ug{^) with isotropic seed can be obtained from the 
class POVM E{dg) = Ug{\r]) {r]\)dg by application of a bistochastic map 

Note that the bistochastic map C^^ is covariant, that is UgC^^ = C^^Ug'ig G G. 

Corollary 3 Let p G St(^) be an isotropic state and P{dg) = Ug{Q be a covariant POVM with 
isotropic seed. Then, the estimation strategy {p, P) has the same cost of the estimation strategy 
{C^\p),E), where E is the class POVM E{dg) =Ug{\ri){r)\)dg. 

The question now is: why should we perform a physical transformation on the input state p 
before performing the measurement? Intuitively, one might expect that the application of the 
transformation C^^ can only degrade the information contained in the signal states pg := Ug{p). 
In the next paragraph I will show that the intuition is correct in the case where p is a class state 
and the cost function c{g,g) has negative Fourier transform. 



= I 

. Using Eq. (14) it is 



4-4- Invariant cost functions with negative Fourier transform: optimality of square-root 

measurements 

Here we will confine our attention to a special family of invariant cost functions, that is, to cost 
functions of the form 

c{9,g)= Yl aa<0 VaGUIrr(G). (21) 

o-eUlrr(G) 

where Ulrr(G) denotes the set of all unitary irreducible representations of G (the complex 
conjugate of the character has been introduced just for later convenience). Optimization 
problems with cost functions in this family are the maximization of the entanglement fidelity, 
corresponding to the cost function given by c{g,g) = -1 TT[Vi,-ig]/n\^, where V : G Lin(C") 
is a unitary representation of G, and the maximum likelihood [17], corresponding to the cost 
function c{g,g) = -S{g-^g). 

Incidentally, we observe that each cost function of the form in Eq. (21) defines a positive 
semidefinite product on the vector space of class functions, given by 

(/i,/2)c [dg [ dgc{g,g)fl{g)f2{g) 

(TeUlrr(G) 

The next theorem summarizes the structures presented so far: 

Theorem 7 Optimal estimation strategy for invariant functions with negative 
Fourier transform. Let G be a compact group, U be a unitary projective representation of G on 

the Hilhert space % = ©^girr(;7) T^f^'^T^f^ ^i^'d c{g, g) be an invariant cost function of the form of 
Eq. (21). Then, the class POVM E{dg) = Ug{\ri) {ri\)dg is optimal for every class state |^>) of the 
form 1$) = 0^g|rr(c/) ^/piJd^\If^))- The cost is given by Cave = EaeUlrr(G) 0^"!^ V^^IpiJ, where 
nia'^ is the multiplicity of the irreducible representation U" in the tensor product <^ U^* . 

Proof. By theorem 5 we can assume without loss of generality that the optimal POVM is 
covariant, i.e. P{dg) = Ug{^)dg, and that the seed ^ is isotropic. Writing the seed as 
C = E[=i with |?7) = 0^ei,,([/) ^/d^\v^u.)) we obtain for the cost 



ciP,<^\e) = J^dgcig,emUgiO\^) 

z rr:^ „ „^,..tTn V ^G 



o-eUlrr(G) iJ.,ue\rr{U) 



= E E «<^v&E/d5x;(5) ((/Mi(t/^c5/M)M)((^^-i(<®^-)i^4 

(T6Ulrr(G) ;i,i/elrr(!7) 1=1-^^ 
<TeUlrr(G)At,i^6lrr({/) 1=1-^^ 



ffeUlrr(G) /i,i/elrr(!7) i=l 



E E ^^f^E^[nr(^^M^C)]' (22) 

o-eUlrr(G) iJ.,ve\rr{U) i=l 



where P^^ is the projector on the direct sum of all irreducible subspaces of ® Hi, that carry 
the irreducible representation U"' as a subrepresentation of the tensor product (g) V^* . Since 
Oo- < for every a, we have the bound 



:(P,$|e)< E 



creUlrr(G) ^l,v<^\rr{U) 



<TeUlrr(G) ix,ve\rr{U) 



r 



o-eUlrr(G) |U,J'elrr(C/) \ i=l 



3=1 



having used the Cauchy-Schwarz inequality | Tr[^^^i?]| < y^Tr[^t^] Tr[iJti5] for block matrices 
A = 0[=]^ Ai and B = 0[=i Sj. Finally, using the normalization conditions of corollary 1 we 
obtain 



c(P,$,e)< E ^^^f^Tr[nr]< E E -'rmrVP^., 

(TeUlrr(G) ^,j/elrr((7) o-eUlrr(G) //,i/elrr(U") 

where rua'^ is the multiplicity of in the tensor product of If^* . On the other hand, by 

substituting in Eq. (22) it is immediate to see that the class POVM E{dg) = l^g{\'r]){r]\) achieves 
the bound and hence is the best possible POVM. ■ 

The optimality of the class POVM can be rephrased as a result about the optimality of 
square-root measurements [18] (see also [13] for a surprising ante litteram example). The general 
definition of square-root measurement that wc will consider here is the following: 

Definition 3 Let {X,a(X)) be a measurable space and let p{dx) : a(X) — )• Lin_|_('H), S i-)- pB 

be an ensemble. The square-root measurement associated to p is the POVM P : a{B) i->- 

_i _i _i 1 

Lin+(Supp(/9x)) given by Pb '■= px^ PbPx^ ' where p-^ is the inverse of p^ on its support. 

It is easy to see that the class POVM E{dg) = Ug{\ri){ri\)dg is the square-root measurement 
associated to the ensemble ^(d^) := \^g){^g\dg where = Ug\^) for some class state 

I*) = ©M6irr(C/) Indeed, we have pG = J^dg \^g){^g\ = 0^gi,r(c/) ^ ^nd 

therefore p^^l^g) = ^^gPc'l^) = ^9 (S^eirr(c/) V^I^m))) = ^sl^)' "^^^^^ means E{dg) = 
_l _ i 

A slight generalization of Theorem 7 is then given by the following: 

Corollary 4 Let G be a compact group, U be a projective unitary representation of G on the 
Hubert space % = ®^^\rr{u) ^{9^9) invariant cost function of the form of 

Eq. (21). Then, the square-root measurement is optimal for every state \"^) £ 7i of the form 
1^') = V\^) where \^) E T-L is a class state as in Eq. (19) and V G Lin('H) is a unitary matrix 
in U'. 

Proof. The square-root measurement associated to the ensemble \^g){'^g\dg is Pidg) = 
UgV {\rj) {r}\) . It is easy to see that the cost of the strategy (^',-P) equal the cost of the strategy 



indeed, the conditional probabilities are the same: 
p'{dg\g) ■.= TT[P{dg)Uamm 

= i:,[UgV{\7i){7^\)Ugvmm'^g 
= TT[Ug{\ii){vmmmwg ■■= pmg) 

This means that the square-root measurement P{dg) must be optimal for l^*), otherwise E 
would not be optimal for |$). ■ 

5. Conclusions and open problems 

This paper provided a synthesis of the main group theoretical structures appearing in the 
estimation of unitary transformations on finite dimensional Hilbert spaces. I deliberately avoided 
any mention to infinite dimensions, in order to keep the exposition at the simplest possible level 
and to highlight the structures that underly optimal estimation in the neatest possible way. 
The development of a general theory of optimal estimation for infinite dimensional systems is 
mostly an open problem, especially in the case of non-compact groups, for which there are 
many particular results (especially for abelian groups) but still no unifying framework. The 
development of a general theory of estimation for non-abelian non-compact groups rests as a 
stimulating challenge for future research. 
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